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Disentangling correlated scatter in cluster mass 
measurements 
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ABSTRACT 

The challenge of obtaining galaxy cluster masses is increasingly being addressed by 
multiwavelength measurements. As scatters in measured cluster masses are often 
sourced by properties of or around the clusters themselves, correlations between mass 
scatters are frequent and can be significant, with consequences for errors on mass es- 
timates obtained both directly and via stacking. Using a high resolution 250h~ 1 Mpc 
side N-body simulation, combined with proxies for observational cluster mass measure- 
ments, we obtain mass scatter correlations and covariances for 243 individual clusters 
along ~ 96 lines of sight each, both separately and together. Many of these scatters are 
quite large and highly correlated. We use principal component analysis (PCA) to char- 
acterize scatter trends and variations between clusters. PCA identifies combinations 
of scatters, or variations more generally, which are uncorrelated or non-covariant. The 
PCA combination of mass measurement techniques which dominates the mass scatter 
is similar for many clusters, and this combination is often present in a large amount 
when viewing the cluster along its long axis. We also correlate cluster mass scatter, 
environmental and intrinsic properties, and use PCA to find shared trends between 
these. For example, if the average measured richness, velocity dispersion and Comp- 
ton decrement mass for a cluster along many lines of sight are high relative to its true 
mass, in our simulation the cluster's mass measurement scatters around this average 
are also high, its sphericity is high, and its triaxiality is low. 

Our analysis is based upon estimated mass distributions for fixed true mass. Ex- 
tensions to observational data would require further calibration from numerical simu- 
lations, tuned to specific observational survey selection functions and systematics. 

Key words: Cosmology: theory, large-scale structure of the Universe 



1 INTRODUCTION 

Although there is no question that galaxy clusters are the 
most massive virialized objects in the Universe, identifying 
the mass of any particular cluster remains a challenge. A 
cluster's mass is however one of its core properties, impor- 
tant for using cluster samples statistically to constrain cos- 
mological parameters, for understanding clusters as hosts 
for galaxy evolution, and for studying the growth and 
other p roper ti es of t he cluster s them s elves. Recent r eview s 



include IVoitl J2005ln iBorganil (|2008h : iMvers et all (|2009h : 



lAllen. Evrard fc Mantj |2011 

In simulations, the cluster mass is the sum of the masses 
of the simulation particles which are cluster members, for 
whatever cluster member (and thus mass) definition is used 
(comp a risons of so m e mass definitio n s are f ound in, e.g., 
IWhitd (1200 lj, l2002h ; iHu fe Kravtsovl (|2003l ); ICuesta et all 
(|2008h : iLukic et all (|2009l )). From a given cosmological set 
of parameters, simulations predict well defined and directly 



measurable masses, accurate to the extent that the simula- 
tion captures the required physics and has the requisite reso- 
lution. These theoretical mass definitions cannot be directly 
applied to observational data, as observations instead mea- 
sure properties of galaxies both within the cluster and near 
the cluster, or gas within and sometimes near the cluster, 
or the bending of space by mass in and around the cluster. 
These observational mass proxies are converted, via physical 
modeling and assumptions, to mass measurements for com- 
parison with theory. Improvements in the noisy mappings 
between observationally accessible cluster properties and 
theoretically calculable cluster properties is much sought af- 
ter by both theorists and observers. Complications for ob- 
servations include projection (and more generally the lack of 
three dimensional informatiorp) and reliance of the mapping 



1 There is a long histor y, e.g. for optical cluster richness 
starting with lAbelll il958T ) and continuing with, for example, 
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between mass and observational proxy upon simplifying as- 
sumptions such as hydrostatic equilibrium. The simulation 
based theoretical approaches, for their part, find it challeng- 
ing to capture the directly observable baryonic physics, in- 
cluding galaxy properties. 

In order to alleviate systematics and reduce errors 
in observationally obtained cluster masses, it is becom- 
ing common to combine measurements from different (of- 
ten multiwavelength) observational techniques. The advan- 
tages of complementary information and crosschecks are un- 
fortunately mitigated by the fact that scatters from dif- 
ferent observational methods are often correlated. Essen- 
tially, as physical properties of the clusters themselves and 
their environments are often the causes of mass measure- 
ment scatters, more than one measurement technique can 
be affected. It is important include these correlations in 
order to properly estimate the errors in mass measure- 
ments of any individual cluster and to avoid a bias when 
stacking clusters on one property a nd m ea suring another. 
( Tor di scussion see [Rvkoff et all d2008T); fCohn fc Whitel 
fUl); IStanek et all l|2010h : IWhite. Cohn fc Smitl ^Olot T 
additional simulated examples of correla ted scatters using 
differe n t observatio n al me thods include iMeneghetti et all 
(2010); iRasia et alj (I2012T) : analyse s are beginnin g to in- 



clude these, e .g. iRozo et all ( |200Sh : iMantz et all |2010h ; 



iBenson et al.l (|2011 ).). A recent application to an observa- 
tional cluster sample, re solving some question s raised by ear- 
lier analyses is found in lAngulo et alj (|2012l ).) 

Here we consider multiwavelength mass measurements 
for clusters "observed" in a cosmological dark matter simu- 
lation. Our primary focus is on mass scatters for individual 
clusters viewed along several different lines of sight. We mea- 
sure and characterize the multiwavelength correlations and 
covariances, and study their relation to other cluster prop- 
erties using both correlations and PCA, principal compo- 
nent analysis. This extends recent work using PCA to com- 
pare relationships betwee n cluster (|Skibba fc Maccioll201ll ; 
Ijeeson-Daniel et al.ll201lT ) properties such as concentration, 
mass and elli pticity in simulations, and some supercluster 
counterparts. [Einasto et alj (|201ll . [2012T ). in observations^ 

The mass observables we simulate are red galaxy rich- 
ness, phase space richness, velocity dispersions, Sunyaev- 



Dalton et alj dl992f); 


Lumsden et al.l dl992h; Ivan Haarlem et al. 


dl997l); White et alj 


19991); Cohn. Evrard. White et al. (2007t); 


Cohn & White! d200£ 


); IRozo et alj d201ll); iBiesiadzinski et al. 


(2012 


), for cluster weak lcnsing (e.g., Rcblinsky & Bartclmann 


(1999 


); iMctzlcr. White & LokerJ d200ll); iHoekstral 


2001); 


de Putter & White 


(2005 


); IMeneghetti et al.l 


2010); 


Becker & Kravtsov 


d201lh ; 


Hoekstra et alj d201ll)), for clus- 



ter Sunyaev-Zel'dovich 
(SZ) flux measurements 



2002); 



19721. 1198' 
(e.g.. IWhite. Hernauist fc Springe' 
iHolder. McCarthy fcBabull d2007l); lHallman et al 
' 20081) 



20071) ; IShaw. Holder fc Bodd d200; 
and for cluster velocity dispersions 



lAngulo et al 



Tormerj dl997l); iKasun 



Con 



2012)) 



20051); iBiviano et alj d2006ft 



1997) 



.. Evran . . 

White. Cohn fc SmhJ d2Q10T ); ISaro et al.U2012T) ) 
2 Comparing cluster mass scatters to physical clus- 
ter properties has a long his tory, rece nt studies include 
lYang 1 Bhattach arva fc Ricked d20ip|); iBecker fc Kravtsovl 
d201ll); iBattaglia. Bond. Pfrommer fc Sieversl feOllft ; 

iBahe. McCarthy fc Kin3 d2012l) . as well as papers mentioned 
above. 



Zel'dovich decremen t and wea k lensing £ statistic 
( Fahl man et al. 1 11994 iKaiserl Il995f ), techniques m use or 
planned for large volume current and upcoming cluster sur- 
veys such as Atacama Cosmology Telescope (ACT0), South 
Pole Telescope (SPlQ), Blanco Cosmology Survey (BCSj|), 
Dark Energy Survey (DES3) and Large Synoptic Survey 
Telescope (LSSlQ). 

The mock simulation measurements, their scatters, 
and general information about PCA are in §2. (Much of 
§2 summarizes work on th e same simulation detailed in 
I White. Cohn fc Smitl d2010h (h ereaft er WCS), an d further 
studied in iNoh fc Cohn! (|201lh and ICohnl d201ll U In §3, 
mass scatters for each cluster, along ~ 96 lines of sight, 
are correlated, and their covariances are analyzed via PCA. 
Distributions of the scatter properties are considered, and, 
cluster by cluster, the PCA direction of largest scatters is 
compared to special physical cluster directions. In §4 cluster 
properties, including the individual cluster mass scatter dis- 
tributions, and environmental and intrinsic properties, are 
intercompared using correlations and PCA. In §5 PCA is 
instead applied to mass scatter for the whole sample of clus- 
ters at once, to analyze scatter including both line of sight 
and cluster-to-cluster variation, with some discussion of pos- 
sible extensions to observations. §6 discusses outliers and §7 
summarizes. 

While w e were preparing this work for publication, 
lAngulo et al.l d2012l ) appeared. They consider correlated 
mass scatter in multiwavelength measurements, in a 4.1 Gpc 
side simulation which also includes X-ray. 



2 SIMULATIONS AND METHODS 
2.1 N-body data 

Our simulation data are the outputs of an N-body simula- 
tion of M. W hite, described in detail in WCS. His TreePM 
dWhitdl2003 ) code was run with 2048 3 particles in a pe- 
riodic box with side length 250 h~ l Mpc. The 45 outputs 
are equally spaced in ln(a) from z — 10 to z — 0. Cos- 
mological parameters were taken to be (h, n, f2m, fg) = 
(0.7, 0.95, 0.274, 0.8), consistent with a large number of cos- 
mological observations. We focus here on mock observations 
at z = 0.1, where our methods have been most closely tuned 
to and tested with observational data, as repor ted in WCS. 
Halo s are found via Friends of Friends (FoF) dDavis et al.l 
Il985l ). with linking length b = 0.168 times the mean inter- 
particle spacing (connecting regions with density at least 
roughly 100 times the mean background density). Clusters 
are halos with FoF masses M > 10 14 /i -1 M o (M hereon will 
mean this b = 0.168 FoF mass, we will also write this as 
Mtrue when comparing to estimates) . There are 243 clusters 
in the box. Note that because we have a periodic box we 
do not need to worry about clusters located near the edge, 
similarly, because we are using FoF as a halo finder, every 
particle is uniquely assigned to a single halo. 



www.physics.princeton.edu/act/ 

pole.chicago.edu 

cosmology.uiuc.edu/BCS/ 

www.darkenergysurvey.org 

www.lsst.org/ 
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Galaxies are take n to be resolved subhalos, whic h 
are found via Fof6d l|Diemand, Kuhlen fc Mad"aul [2006), 
with the implementation as descri bed in the appendix of 
WCS. S ubhalos are trac k ed (s ee IWetzel. Cohn fc White! 
(|200gT) : IWetzel fc White! (|2010h for particular details) 
from their infall into their host halos in order to 
assign luminosities via subhalo abundance matching 
l|Conrov. Wechsler fc Kravtsov|[2006h . The resulting galaxy 
catalogue minimum luminosity at z = 0.1 is 0.2L, (again 
see WCS for more discussion and validation tests of the cat- 
alogue galaxy properties with observations). 

2.2 Cluster mass measurements and scatters 

We consider five cluster mass measurement methods with 
this simulation (see WCS for specifics): 

• iVred: Richness using the lKoester et all (|2007l) MaxBCG 
algorithm based upo n colors. Galaxy c olors are assigned us- 
in g the algorithm of l&kibba fc Shethl (I2009T) with evolution 



of IConrov, Gunn fc White! (120091 ): IConrov. White, fc Gunnl 
l|2010l) : IConrov. fc Gunnl (boich . Galaxies are taken to be 
"red" if they have g — r within 0.05 of the peak of the red 
galaxy g — r distribution specified bv lSkibba fc Shethl (|2009h 
for their observed Mr, again see WCS for more detail. 

• iVph: Richness based upon spectroscopy, with 
cluster membership assigned via the criteria of 



lYang. Mo fc van den Boschl (|2007T ). 

• SZ: SZ flux (Compton decrement) is assigned to ev- 
ery particle by giving it a temperature based upon the 
mass of its halo. For every cluster, its measured SZ flux 
is then the flux within an annulus of radius rigoi, (the ra- 
dius within which the average mass is greater than or equal 
to 180 times background density), through the length of 
the box, apodized at the edges. This was shown in e.g. 
IWhite, Hcrnq uist fc Springe! ( 2002) to well approximate hy- 
drodynamic simulation results for SZ at the scales appropri- 
ate for two cluster surveys mentioned earlier, SPT and ACT. 

• Vel: Velocity dispersions calculated via the method de - 
tailed in WCS , and b ased on. den Hart og~fc Katgertl l|l996h : 
iBiviano et"afl |2006h ; IWoitak et al.l (120071 ). Phase space in- 
formation is used to reject outliers and the mass estimate 
includes the harmonic radius (calculated as part of the out- 
lier rejection, more details and definitions in WCS). 

• WL: Weak lensing using a singular isothermal sphere 
(SIS) or NFW model to assume a cluster lens profile 
and then fitting the projected ma ss, using the £ statistic 
jFahlman et al.lll994l ; Kaiserl Il995t ) , in a cylinder with ra- 
dius risob and (apodized) length of the box (again WCS 
describes fitting models, etc.). 

The red galaxy richness, phase space richness and ve- 
locity dispersions measured in our simulations are expected 
to include the majority of systematics that are present in 
real observations. The weak lensing and Compton decre- 
ment (SZ) observations however do not include all known 
systematics, such as miscentering, shape measurement 
and source redshift errors for lensing, and foreground 
and point source removal for Compton decrement. The 
relatively small box size (250 h^Mpc on a side) also 
means that line of sight sc a tter i s underestimated (e.g . , 



lAngulo et all d2012h for SZ and iReblinskv fc Bartelmannl 
Jl999l): iMetzler. White fc Lokenl l200ll): iHoekstral J200ll) ; 
de Putter fc Whitej 
Becker fc Kravt sovJ 



(|2012| ); iHoekstra et al.l 



2005) 
2011 



2011 



Mcncghct ti et all ( 201C 



White. Hernguist fc Springell (120021 ) : lHallman et all (2007) 



Holder. McCarthy fc Babull (|2007l ): ICohn fc White! (2009); 



iBahe. McCarthy fc Kind 
for lensing). 

We extend the cluster sample use d in WCS to a lowe r 
mass r ange , M > 1O 14 /i _1 M , as in iNoh fc Coiml (|201ll ): 
ICohnl (|201lh . The five observables listed above are found 
along 96 lines of sight for each cluster, each time placing the 
cluster at the center of the periodic box. Just as in WCS, 
lines of sight for clusters are removed for all measurement 
methods when a more massive cluster has its center within 
risob along the line of sight (this removes ~ 400 of the orig- 
inal ~23000 lines of sight). In addition, to allow fair inter- 
comparisons, only lines of sight which have reliable mass 
measurements for all methods are included; the ~ 90 lines 
of sight with fewer than 8 galaxies making the cut for a ve- 
locity dispersion estimate, or either richness < 1.1 are also 
removed. These cuts will have some effect on the scatters we 
consider but would be expected to be identifiable observa- 
tionally. 

We take the logarithm of these observables and that of 
the true mass M to find the mean relations for all clusters 
with M > 1O 14 /i _1 M . We find relations for bins of M vs. 
observables, because of the large scatter at low mass. The fits 
are done by throwing out 3 a outliers for three iterations. 
This gives us our map between the observables and mass 
estimates Mjv rcd , A 7 /jv ph , M S z, M Vel , M W l- 

The distribution of the fractional mass scatters, (M es t — 
Mtrue)/Mtf U e, for the five mass measurement methods along 
~ 96 lines of sight for each of the 243 clusters, is shown in 
Fig. fJJ The scatters range from ~ 0.2-0.5, with the small- 
est mass scatters associated with Compton decremenjf] and 
N p h- Note that this work follows a theoretical approach 
where all the mass measurements are related to the known 
true mass, a quantity inaccessible in observations. In partic- 
ular, the sample is cut on the unobservable value of Mt rue 
and the distributions we consider are M es t(Mt ru . e ), not vice 
versa0 For an observational sample based upon some mea- 
surement Me S t, such as richness mass or weak lensing mass, 
often the quantity of interest is the scatter in P(M trU c|M cs t) 
(see also extended comments in Ej5j 



2.3 Filaments and Galaxy Subgroups 

In section 34] clu ster filament properties calculated in 
INoh fc Cohnl |201l| ) and c luster galaxy subgroup properties 
calculated in I Cohnl (|201ll ) for this simulation are used. De- 



8 Recall the caveats for both SZ and weak lensing measurements 
mentioned earlier, however. 

9 We thank E. Rozo for extremely helpful discussions on this. 

10 For example, the richness mass scatter in Fig. [TJ looks 
essentially Gaussian, unlik e the double peaked dist ri butio n 
found and studied in e.g . ICohn. Evrard. White et al.l d2007l) ; 
lErickson. Cunha fc Evrardl j201ll ). If looking at P( Mtr„JM mt ). 
one will find clusters which are "blends", i.e. several halos 
which contribute to one apparent halo, often with a higher M eB t 
than any of the c ontributing halos. These blends (see also e.g. 
iGerke et al.l ||2005| ) for discussion) are the source of the bimodal 
mass distribution reported and used in these other papers. 
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Figure 1. The fractional mass scatter, M es t : i/Mt r u e — 1 for all 243 
clusters along ~96 lines of sight each, for the five mass measure- 
ment methods we consider. Solid lines are the masses estimated 
via Nte&jNpk, SZ (Compton decrement), velocity dispersion and 
WL (weak lensing) as described in section [221 An approximately 
0.2-0.5 standard deviation is found. The (for the most part dif- 
ficult to distinguish) dashed line is a least squares Gaussian fit, 
giving (Tfl t , which is much smaller than the standard deviation 
for SZ. The WL and SZ mass scatter is expected to be strongly 
underestimated compared to true observations, as the 250 Mpc/h 
box size is too small to include all contributions to mass scatter 
along the line of sight to the observer, and in these two cases 
several known systematics are not included as well. See text for 
more information. 



tailed background can be found in those two papers, but we 

briefly summarize some key a spects here. 

Filaments are found in iNoh fc Cohr] j 20111 ) using a 
m odification of the da rk matter halo based filament finder 
of IZhang et all |5ool)E| This filament finder searches for 



Although the cosmic we b w as noted years ago 

llZel'dovich. Einasto fc Sha idarin 1982; Shandarin_fc_Z£Tdovicl] 
' 1984 Bond, Kofman fe Pogosvanl 



19831 ; lEinasto et alj 



19961 ). no unique filament finder exists. A variety of 
finders are in use, based on a wide range of dark mat- 
ter, halo and/or galaxy p roper tie s, including for example 
iBarrow. Bhavsar fe Sonodal dl985T) ; iMecke. Buchert fc Waenerl 



dl994h: 



Is^hmT* Sathvaprakash fc Shan darinl dl99STl : 



ISchmalzing et al.l 19991) 
||2000 | 1 : ISheth et al] l2003ll 

iPimbbletl ll2005allblk 



iNoyik ov. Colom bi fc D ord 
d2007ah; IColberd [I2007T). 
van de Weygaert fe Schaapl ll2007ll 



Colombi, Pogosyan fc Souradecp 
Colberg, Krughoff & Connolly 
IStoica et al.1 1I2OO5P 
2006h : lArag0'n-Calvo eta! 
Hahn et alT ||2007; " 



Saarl |2007| ~ 
Forero-Romero et al.l d2009h: 



Sousbic et al. 



Bond J _Stmuss_& i CJen| 



Pogosvar^et^iL (1200911 ; J Sousbie, Colombi „ 
Stoica. Martinez fc Saarl d2010h: IWu. Batuski. fc Khalill d200l 
Zhang et al. I <200Sh; lAragon-Calvo. Shandarin fc Szalavl 



(201' 



rig c 

IE 




Aragon-Calvo. van deWey gaert^ T*JoliesI 
Bond, Strauss fc Cenl d2010bl'l; iMurphv, Eke fc Frenkl 
Sousbid ll201lfl;IWav. Gazis fc Scargle! (2011); Sh andarin! 
Shandarin, Habib & Hcitmann (2011 
Smith et al.1 d2012T ) " 



; Heitmannl (l201lh : iGenovese et all 
Ijasche fc Wandelj d2012h . 



bridges 10 h~ 1 Mpc or smaller between halos above 3 x 
10 10 /A Mq, starting with the most massive halos as poten- 
tial bridge endpoints. Some clusters (16/243) end up within 
filaments because of the finder, such as less massive clusters 
located between two close (< 10/A Mpc) massive clusters 
and clusters closer than 3h~ 1 Mpc to a larger cluster. The 
rest of the clusters each lie at the center of a filament map 
extending out to 10 h~ 1 Mpc. We found that filaments, halo 
mass (halos with mass > 3 x 10 10 /AMq) and galaxy rich- 
ness all tended to lie in a planar region around each clus- 
ter. We characterized these regions by taking a fiducial 3 
h~ 1 Mpc high disk centered on the cluster which extends 
out to the edge of the radius 10 h~ 1 Mpc sphere. We use the 
planes related to halo and filament mass below. For halo 
mass, we randomly sampled 10,000 orientations to maxi- 
mize the halo mass fraction in the plane (relative to the 
halo mass in the 10 h~ 1 Mpc sphere). For filaments, we 
considered planes spanned by the cluster and pairs of fil- 
ament endpoints, and then took the plane which enclosed 
the most filament mass. This plane was not found for clus- 
ters lying within filaments. See lNoh fc Cohr] (|201ll ) for more 
details. We consider four quantities from this analysis below: 
Apinnt (halo mass fraction in plane relative to that in 10 
/AMq sphere), fM, plane (cluster filament mass fraction in 
plane relative to sphere), and the respective plane normal 
directions n mass , rifii . 

Galaxy subgroups were characterized in ICohnl (l201ll ) 
for this simulation. These are groups of galaxies that fell 
into a cluster as part of a shared halo at an earlier time. 
Within the clusters, they share some coherence in space and 
time which can remain for several Gyr. We will use for each 
cluster its largest (richest) galaxy subgroup, in particular its 
fractional richness relative to that of the cluster, /ii 3ub , its 
displacement relative to the cluster center, divided by the 
cluster long axis /u Sub , and the directions of its position 
and average velocity relative to the cluster center, r sub , v S ub- 



2.4 Principal Component Analysis 

For context and backgr ound, we sum marize PCA and our 
notation here (see, e.g. Ijolliffel l|2002l ) for extensive discus- 
sion). PCA can be used when there are several correlated 
or covariant quantities. It is essentially a rotation of axes 
to find linearly independent bases (i.e. quantities which are 
not covariant or correlated), and is based on a model where 
some underlying average linear relation is present. We will 
apply PCA in a few different contexts. 

Our starting application will be for individual clusters. 
For each individual cluster and line of sight, we have sev- 
eral different methods to estimate the true cluster mass. 
Each line of sight can thus be associated with five numbers, 
where each number is the mass measured in one method. 
These numbers can then be thought of as coordinates in 
some five dimensional abstract space, with each axis in this 
space corresponding to a different measurement method. 
All of the different lines of sight considered together then 
give a cloud of points in this space of measurement meth- 
ods. PCA gives the properties of the "shape" of the mass 
scatters in this space, around their average values for the 
combined observations, for each cluster. We will consider 
these shapes and how they relate to other cluster prop- 
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erties. In addition, correspondences can be found between 
large mass scatters and physical properties or directions of 
the cluster. The PCA direction with smallest mass scatter 
is useful as well. Taking the ensemble of clusters, groups of 
properties which change together can be inferred by using 
PCA on the f ull set of correlations (this la tter approach was 
pionee r ed by Jeeson-Daniel et all f|201 lh ; ISkibba fc Macciol 
i|201ll ); lEinasto et all (|201lD ). 



There are other uses of PCA, and caveats as well. PCA 
is often used to find the minimum set of variables needed to 
describe a system to some accuracy, for instance the dom- 
inant contributing basis vectors composing a galaxy spec- 
trum. As far as caveats go, one concern is that if correlated 
variables are not scattered around a linear relation, a sim- 
ple rotation of basis using PCA will not usefully separate 
them. For this reason, sometimes other functions are used 
besides the variables themselves, e.g. logarithms, when it is 
suspected that variables might be related by power laws. 

For illustration, we take a set hypothetical measure- 
ments for two methods, as shown in Fig. [5] Each pair of 
measurements, by the two methods, is a position, i.e. a dot, 
in this plane labeled by two coordinates. We take one co- 
ordinate to be the shifted mass using red galaxy richness, 



Ml 



t), and the other to be the 



shifted weak lensing mass, My 



= M 



WL,cst 



{My 



Here, a denotes which particular point is being measured 
and the average is over all the points shown, all a values, 
i.e. (M NrBd , e3t ) = w^Ea M N rei , B sf The vector M D Q bs de- 
notes (Mx rt , d , M$fL,) an d N a is the number of measurements 
(points) indexed by a. For our first application below, all 
different values of a pertain to the same cluster, but label 
different lines of sight. The shift by the average over all the 
points (all a) guarantees that (Mwl) = (Mj\r rcd ) = 0. 

Diagonalizing the covariance matrix for (Mjv rcd , Mwl), 
i.e. found by summing over all a, produces orthonormal 
eigenvectors Pd (principal components) with eigenvalues 
A;. The eigenvectors are illustrated in Fig. [2] and are the 
axes of a new coordinate system in the space of measure- 
ment methods in which the measurements have zero covari- 
ance. The Pd can be expressed in terms of the original 
basis directions, 



PCi = j3 Te d, ie re d + /9\VL, i&WL 



(1) 



which identifies the mass scatters contributing most to each 
Pd. For example, if /3 ro( j, o is large, then most of the scat- 
ter in the direction of PC'o also lies in the direction of e re d- 
One also sees how different scatters are related. In this sim- 
ple example the biggest scatters come from increases or de- 
creases in both MrdjMwL simultaneously. This implies, in 
this case, that PCo might be related to an average overall 
mass shift, amongst methods, as well. Note that the overall 
signs for the Pd are arbitrary. 

A point (or vector from origin, labeled by a) in the 
original space can then be rewritten in the basis spanned by 
PC ,PCr. 



M a = M r Q ed e red + M WL e WL = a%PC + a$PGi 



(2) 



For example, the point marked by a star in Fig.[2]has coordi- 
nates M* = 10 14 e rcd + 10 14 e W L = s/2 x 10 14 P"C + PCi. 
That is, (M* cd ,M WL ) = (1,1) x 10 14 (assuming measure- 
ments are in units of h' 1 Mq) and (ao, a±) = (%/2, 0) x 10 14 . 
The variances in each of the new directions, associated 



ewL 



&red 



Figure 2. Principal component analysis takes a set of cor- 
related measurements (using two methods shown as basis ele- 
ments e rod ,eyyL) and rotates their coordinates to a new ba- 
sis, shown as dotted lines, where the measurements are uncor- 
related. For example, the point marked as a star (i.e. a = *) 
is M* = 10 14 e rcd + 10 14 e WL = s/2 x 10 14 PC + P'Ci. The 
generalization to more measurement methods and thus a higher 
dimensional space is immediate. Amongst the orthonormal PCi, 
we choose PCo to be along the direction of largest scatter (i.e. 
to correspond to the largest variance, Ao), PG\ to be along the 
direction of second largest variance, etc., as described in the text. 



with the coefficients af, are the eigenvalues of the princi- 
pal components. Thus, Ao is the eigenvalue associated with 
PCo, etc. For all PCA eigensystems we consider here, we 
will order i < j if Xi > Aj and define A = X^i an d 
FJ A = Yli • Generally, if there are iV met hod measurement 
methods, there are JV mo thod PCi, spanning an iV mc thod di- 
mensional space. In Fig. [2] A mot hod=2; when we consider 
the five different mass measurement methods below, for ex- 
ample, Amcthod = 5. For PCA applied to different cluster 
properties, in H4.3I below, we have iV mct hod = 24. We will 
apply PCA to covariance and correlation matrices, using the 
Pearson covariance Cov(xy) = N 1 _ 1 X^=i( a;< * — &){Va ~ V) 
where x is the average of the N a points x a , etc., and 
Cov(a:, y) / \JCav{x, x)Cov(y, y) its associated correlation, 
for N a measurements F^l 

The sum of the A;, ^ A, is the sum of the variances of 
the measurement methods, their product, FJ A, is related to 
the "volume" in this space of scatters, i.e. how the measure- 
ments are spread out in the space of measurement methods 
(specifically, for the example in Fig. [2] VAoAi is proportional 
to the area of the ellipse) . A small eigenvalue A; means that 
the scatter in the corresponding Pd direction is small, i.e. 
that the volume of scatters is roughly confined to a lower 



For the cluster mass scatters, as this covariance can be af- 
fected by outliers, we experimented with several outlier rejection 
schemes. As the resulting values were somewhat similar, and our 
analysis is in part just to provide an example, we use the untrim- 
mmed Pearson covariances and correlations hereon. We also con- 
sidered for some properties (as did some of the earlier cluster 
PCA work) the Spearman correlation. The Spearman correlation 
coefficient uses the ranking of the measurements rather than the 
raw measurements themselves. Trends were similar to the Pearson 
covariances and correlations. 
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dimension. In particular, if most of the scatter is due to Ao, 
then there is a close to li near relation p r esent . Such lower 
dimensionality was used in lEinasto et al.l 1)20111 ) to come up 
with scaling relations for superclusters. For PCA of correla- 
tions, a large Ao occurs if the initial measurements via dif- 
ferent methods have strong correlations; for covariances, a 
large Ao can also occur if an individual measurement method 
has large scatter. 

Measurement methods M" bs j (in the example 
MAr red ,MwL) with the largest correlation or covariance 
with af (their projection on PCi) can be thought of as 
those dominating the scatter in the direction of PCi. 
This covariance or correlation is not unrelated to the 
measurement method's contribution to PCi, {P hs,j,i in 
Eq. [1]). The covariance or correlation is largest when 
fiohs,i,i is large, an d when the eigenvalue Xi is large 
relative to the other /3 bsj's and Xi. (For PCA on corre- 
lations in particular, (M bs,j a i) I V (^ob S jM obsj }(a 4 ai) ~ 
/3obs ,j,»Ai/ J Xi J2 k ^k/3% hs j th ; if A ,^obsj,o are large and a 
is considered then this approaches 1.) 

To summarize, PCA is a method taking a set of mea- 
surements via different methods which are correlated and 
separating them into uncorrelated combinations. In particu- 
lar, if one denotes each set of measurements as a position in 
some space, along axes corresponding to each type of mea- 
surement method, then PCA is a rotation of coordinates in 
this space. The volume and shape traced out by the points 
representing the measurements are related to the PC eigen- 
vectors and eigenvalues, and the direction in the space cor- 
responding to the largest PC eigenvector is the combination 
of measurement methods with the largest scatter in its dis- 
tribution. 



3 VARIATIONS FOR A SINGLE CLUSTER 
DUE TO LINE OF SIGHT EFFECTS 

As mentioned in the introduction, many of the mass mea- 
surement method scatters in Fig. [T] are comparable in size 
because they are due to similar properties of the cluster or 
its environment. Correlations between the scatters are thus 
expected, and as noted above, these correlations and their 
consequences become increasingly important as multiwave- 
length studies become more common. 

We first consider each cluster and its line of sight mass 
scatters separately. In this way, the "true" object and its 
true mass remained fixed; all variations in scatters are due to 
changes in line of sight. Several examples of correlations for 
these scatters were already noted and illustrated in WC£r^l 
and correlations with various p hysical proper ti es (di s cussed 
below ) were further studied in iNoh fc Cohnl (|201lh : ICohnl 
|201ll ). Here we statistically describe these correlations and 
covariances. We characterize cluster to cluster trends and 
variations in line of sight mass scatters and predicted masses, 
and then apply PCA to these scatters. 



13 An example of correlations between velocity dispersion and 
weak lensing measurements are shown in WCS Fig. 14 
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Figure 3. Cluster mass scatters (M os t/Mt ruo — 1) for one cluster 
of mass 4.8 X 10 14 /i — 1 Mq, along 95 lines of sight. The correlation 
and covariance for each mass measurement method pair is shown 
at the top of each panel (the y-axis is for a larger scale to allow 
room for these numbers). Large correlations are present for many 
pairs of mass measurement methods. 



3.1 Correlated mass scatters for different cluster 
observables 

To give an idea of the correlations and covariances for our 
five different mass measurement methods, we start with an 
example: the 10 pairs of mass measurements for a single clus- 
ter (M = 4.8 x lO 14 fr _1 M ) shown in Fig. H Each panel 
shows a different pair of mass estimates along all lines of 
sight (i.e. N a = 95), and correlations and covariances are 
listed at the top of each. We use M es t /M trU e — 1 to focus on 
fractional mass scatter. As can be seen, many of the corre- 
lations are large. 

For all 243 clusters, the correlations for the same pairs 
of mass measurement methods are compiled in Fig. [4] with 
medians and averages given in Table [1] Strong correlations 
are frequent. For each cluster, at least one pair of mass mea- 
surement methods has correlation > 0.4, and the largest pair 
correlation is often larger, ~ 0.7. Within our cluster sample, 
the mass scatters for (Mjv rod , Mjv ph ) are most often the high- 
est correlated pair. The other measurement method pairs 
which frequently have the highest correlation (but not as 
often as (M/y red , M Nph )) are (M Niad , M S z), (Miv rcd , Mwl), 
(Mjv ph , M sz ) and (M Nvii , Mwl). The pair with the min- 
imum correlation is most frequently (Mvei, Mwl) (closely 
followed by (M Ve i, Msz), and (M Ve i, M Nph ))- 

For our PCA analysis below, we will use covariances 
instead, shown in Fig. \5\ with medians and averages given 
in Table [T] These are more relevant for understanding the 
actual mass scatters and how they change together, rather 
than, for example, how much a relatively large Msz mass 
scatter corresponds to a relatively large Mjv rod mass scatter. 
As the fractional mass fluctuations (the a values in Fig. m 
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Figure 4. Correlations of M os t/Mtru C — 1 for pairs of mass mea- 
surement methods (Mjv red , Mjv ph , Msz; Afyel, Mwl). 
The solid line is the distribution of cluster mass scatter cor- 
relations, for all clusters individually, for the same pairs as in 
Fig. [3] The dotted line corresponds to the 70 clusters with mass 
> 2 X 10 14 /i~ 1 A/q. The vertical dashed lines are at the median 
values which are listed in Table [l] along with the average values. 
Note that the x-axis, the range of correlations, has a scale which 
varies widely between different types of measurement method 
pairs. 
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Table 1. Average and median values for the distribution (for 
243 clusters) of pairs of mass measurement method correlations 
shown in Fig. [4] and covariances shown Fig. [5] 



tend to be about 0.3, the covariance sizes should be com- 
pared to ~0.09. The largest covariances are between (Mjv rBd , 
Mwl and Myei). Each cluster has at least one covariance > 
0.01. In contrast, the minimum covariance between measure- 
ment methods tends to be between the pair (Mjv ph , Msz)E3 

14 If the abovementioned neglected weak lensing and SZ scatter 
is uncorrelated with local properties of the cluster, as expected, 



Figure 5. Covariances of M cs t / Mt ruc — 1 for pairs of mass mea- 
surement methods (Mjv red , Mjv ph , Msz> Myei, Mwl)- 
The solid line shows the distribution of cluster mass scatter covari- 
ances, for all clusters individually, and the same mass observation 
pairs as in Figs. 13 141 The dotted line restricts to the 70 clusters 
with mass > 2 X 10 14 /i _1 Mq. As the characteristic mass scatter 
for most clusters is about 0.3, the range for covariances shown is 
±0.05. The vertical dashed lines are at the median values, which 
are also listed in Table[T] Note that the x-axis, the range of covari- 
ances, has a scale which varies with types of mass measurement 
method pairs. 

Some of these correlation and covariance trends are un- 
derstandable (larger covariances tend to go with quantities 
with larger scatter more generally and vice versa), but oth- 
ers rely upon the interplay between different measurement 
methods and the causes of the scatter. The relative impor- 
tance of different contributions to these were not a priori 
obvious to us, although reasons could be found for trends. 
For instance, the two richnesses often are the most corre- 
lated mass measurement pair. This is perhaps because both 
use the same objects (galaxy counts), so that an enhance- 
ment or decrement of Mjv b (galaxies making the spectro- 
scopic cut) might be more likely accompanied by a similar 
change in Mjv rod (galaxies making the red sequence cut), 
than by changes in the mass measurements by other meth- 
ods. Other mass measurement methods are correlated with 
richness less directly (for Msz and Mwl the measurements 
and thus presumably the scatter are more directly tied to 
the dark matter distribution rather than the biased galaxies; 
Myei just seems to be weakly correlated with most things). 
In section iJ3.3l we compare the PC A results (scatters which 

when this scatter is included the majority of the covariances will 
not change (except possibly the one between weak lensing and 
SZ, as they can possibly have correlated changes in their scatter 
due to structure outside the box). However, the correlations will 
change as they are divided by the correlations of SZ with itself or 
weak lensing with itself. 
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occur together) with physical cluster directions to get some 
idea of which properties might be driving covariant mass 
measurement method scatters. 

3.2 PCA for individual clusters 

We now apply PCA to the covariances for M bs/A4true for 
each individual cluster, to get a new basis, 



Mo- 



il//;: 



il/a 



Mtruc M tTul 



(3) 



The subtracted offset M avc = (M cst ), where the average is 
over all the lines of sight for each cluster of interest, i.e., 
there is a different M ave for each cluster. The median (and 
rms around zero) values for the ensemble of clusters, for 
|Mavo/Mtrue - 1| are [0.21 (0.32), O.lf (0.17), 0.08 (0.15), 
0.11 (0.18), 0.18 (0.27)] for M Nxed , M Nph , M S z, M Vel , M WL 
respectively. The relative sizes of the line of sight scatters 
around the average mass, M ba/A/t ruc — Af avc /Mt ruc com- 
pared to the line of sight averaged mass around the true 
mass Af avo /M truc — 1 varies widely cluster to cluster. Except 
for velocity dispersions, the rms scatters of (A? bs/Aftrue — 
M a ve/M truc ) is > | A/ avc /A/true - l\Joi 50-60 percent of the 
clusters and the median value of |A? obs /M truo — M ave /M tTue \ 
is > |M ave /Mtruc — 1| for about 30-40 percent of the clusters, 
for velocity dispersions the numbers are closer to 90 and 70 
percent respectively. 

As in section i]2.4l and by the definition in Eq. [3l M a b s 
refers to mass measurements which have zero average when 
summed over the sample of interest. Any vector in this space 
can of course be written in terms of the orthonormal basis 
PCi : M', what is special for M a bs is that the variances of 
the a™ are equal to Xi for gaussian scatter. Because there 
are five mass scatters, there are five PC vectors Pd t M per 
cluster, with eigenvalues A^a/, again ordered Xo,m > Xi,m > 
A2,m and so on. We use the subscript M to distinguish these 
PC vectors from others which will be considered in section 
and take M b s ,i, with i = 0,1,2,3,4, to correspond to 
(M NiBd , M Nph , M sz , M Val , M W l) and similarly for e obM . We 
also have, as in Eq. [T] 

4 

PCi t M = /3obs,j,ie bs,j • (4) 

obs,j— 

We first consider the PC eigenvalues, the Ai,M- There 
are some trends: the fractional scatter in the largest direc- 
tion ( Ay ) ~ 0.7, but can vary from 0.4 to ~ 1, as shown 
in Fig. [6] The relatively large contribution from Ao,m means 
that the variance is strongly dominated by the single combi- 
nation of mass scatters in the direction of PCo,m- As seen in 
Fig. [6] bottom, Ao,M, Xi,m , A2,m together comprise almost 
all the variance for most clusters. The presence of some mass 
measurement methods with small scatter suggests that there 
are some directions of the combined measurement methods 
which would also have small scatter and thus small Xi,M, and 
this is seen in the much smaller values of Xa,m and some- 
times A3, a/. The distribution of covariance matrices shown 
in Fig. [5] determine the A^a/ when combined with the rela- 
tion of the covariances to each other, cluster by cluster. The 
combination of measurement methods in PCa,m which has 
the smallest variance, Xa,u in our case, is also interesting, 
we return to this in 35] 
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Figure 6. (Top) Peaks from right to left: fraction of covariance 



0.7. 



(i.e. scatter) ^f, ^f-, etc. On average ^f- 

(Bottom) Peaks from left to right: fraction of covariance from 
scatter in direction of PCo mi PCq Af or PC\ m> etc. Most of the 
scatter is in the directions spanned by PCq^m i PC\ m > PC2,M > 
with a substantial fraction in the direction of the largest scatter. 



The sum and product of the Xi,M f° r all clusters are 
shown on a logarithmic plot in Fig. [7] The sum of scatters 
can vary by a factor of ~ 30 from cluster to cluster, and tends 
to be dominated by Ao,m, while the product of the Xi,M can 
be made very small (its size varies by over 10 7 ) if some 
directions, especially PCa.m, have very little scatter. In Sj4] 
below, these scalar properties of the cluster mass scatter will 
be compared to physical cluster properties such as triaxiality 
and mass. 

Turning to the PC vectors, many clusters have similar 
PCo,m (i.e. the combination of mass scatters that domi- 
nates is similar for many of the clusters). To quantify this 
more generally, we took PCi t M,minsq as the direction which 
minimizes (PCi t M • PCi,M,m,insq) 2 for the full ensemble of 
243 clusters (PCo,M,minsq is shown in the first line of Table 
[2}. Fewer than 20 percent of the clusters have their PCo,m 
pointing more than 45° away from PCo,M,minsq', about 25 
percent have their PCi : m pointing more than 45° away from 
PCi t M,minsq- We found that for almost all the clusters the 
projection upon PCi : M,minsq is close in size to the projec- 
tion on PCo,M,minsq, (i.e. 0.62 correlation). Continuing to 
PCa,Mi the direction of least scatter, ~ 70 percent of the 
clusters are within 45° of PC/t t M,minsq given in Table [2] 
This minimum scatter direction is (not surprisingly) domi- 
nated by the eN ph ,esz directions, since these are the mass 
measurement methods with the smallest scatter in our sam- 
pleB 

The similar forms of the Pd suggest that the mass 



15 Again recall these particular coefficients do not carry over di- 
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Figure 7. Sum and product of covariances for all clusters: the 
sum is dominated by the eigenvalue in the direction of largest 
scatter, Xq,m, while the product, related to the volume in the 
space of scatters, can be made very small by small values of e.g. 
-^4,M- The sum of eigenvalues peaks at ~ 0.3, the product peaks 
around 10 — 8 . 



scatter combinations they correspond to might have similar 
physical origins. For 75 percent of the clusters, the coeffi- 
cients of PCq,m are also all the same sign, that is that the 
dominant combination of scatter also has all the scatters in- 
creasing together relative to their average values, but 16/243 
have large (< —0.1) opposite sign coefficients for some mass 
measurement methods. As a large fraction of the variance is 
captured by PCo,m , as seen in the large values of A ,m / Yl -V 
the coefficient of PCo,m for the any given line of sight is in- 
dicative of the size of scatter from the average along that 
line of sight. 

The correlations of the five mass measurement meth- 
ods M bs,i with the PCj coefficients af, for all clusters, 
are shown in Fig. [8] For the ensemble of cluster measure- 
ments, the largest correlation with ao, i.e. with the coeffi- 
cient of PCq,m, is for velocity dispersions (in part because 
most of the scatter is due to velocity dispersions). Correla- 
tions of Oq with the other mass measurement methods are 
relatively smaller, and of similar size to each other. Tak- 
ing instead the fraction of the mass scatter vector due to 
PCo, i.e., ao/\/S - ( a ?) 2 > the correlations are weaker. In 
addition, weaker correlations arise with the coefficients of 
PCi,m, and for the direction with the smallest direction of 
scatter, PCa,m, only Mjv ph has a noticeable correlation on 
average. 

The direction of PCo,m can also be compared to that of 
M aV e/Mt r ue: the average inner product ~ 0.7, but the peak 



rectly to observations, most importantly because they are based 
on scatter in P(M os t|Mtr Ue ) and not vice versa. 
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Directions 
PCo 

C() , M , m i n s q 
PC>0,M,total 

PC 

,M , total, massive 

PC 4 

PC A, M,minsq 
PC'4 i M,total 
PC A, M, total, massive 

Table 2. A comparison between different characteristic di- 
rections in the space of mass scatters, (eN rod , eN ph , esz, e V el, 
£wl)- The direction which minimizes the dot product squared 
with PC^m of an 243 clusters is PC it u,minsq- PC i, M , total 
are the vectors found by PCA analysis of all observed cluster 
masses (normalized by Mtrue) and all lines of sight, jointly, dis- 
cussed in [T5] PCi m, total, massive restricts to the 70 clusters with 
M > 2 X 1O 14 /i _1 M . These vectors are derived from our esti- 
mates of M es t (Mtrue), which as mentioned earlier, neglects some 
systematics for WL and SZ and relies upon our simulation cali- 
brated mass definitions in terms of observables. 
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Figure 8. The correlations between the observed mass scatters 
and their projected values on PCi m- Velocity dispersions are 
very strongly correlated with the direction of the largest scatter, 
PCq m m part because the component of PCq m m the velocity 
dispersion direction, j3 ve i q, 



tends to be large. The dotted line 



marks the median value of the correlations. Note that the y-axis 
scale varies with mass measurement method and PCi, the x-axis 
does not. 



is closer to 0.8, and there is a broad range of values. (The 
rest of the vector ^ avc seems to lie in the PCi,m ,PCi,m 
plane). That is, the direction of largest mass measured is 
close to that of largest scatter; presumably this is because 
this direction has large scatters generally. 

We now compare the above quantities to line of sight 
dependent cluster properties. We will follow this in section 
[f4] with properties depending on the entire cluster rather 
than a given line of sight. 



3.3 Relation to cluster line of sight properties 

To get more understanding of the PCA decomposition, we 
compare values of PC quantities along lines of sight to clus- 
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ter properties along those lines of sight. The PCA line of 
sight dependent properties we consider are the coefficients 

af of the PCi,M, a ? / ( a ?) 2 (the fraction of scatter in 



different Pd directions, i and j each run from to 4), and 
the total scatter for a given line of sight Q^(af ) 2 /Ai,m)- We 
correlate these with the angle o b B between the line of sight 
and six specific physical cluster directions, listed below. 

Previously, for this simulated data set, increased mass 
scatter was found by ob serving along certai n physical clus - 
ter directions by WCS; iNoh fc Cohrj (|201ll l; ICohnl (|201lf ). 
Our extension using PCA includes six physical directions: 
the long axis £ of the cluster, calculated using the dark mat- 
ter particles in the simulation with a FoF finder as men- 
tioned above, the plane normal containing the most halo 
mass rimass, or conne cted filament mass r im centered on the 
cluster (see 3231 and INoh fc Cohnl l|201l[ ) for more details), 
the direction of the largest subgroup of galaxies r su b which 
originated from the same infall host halo (see 32.31) . and the 
velocity direction v sn ^ of this largest subgroupl^fl Many of 
these special clus t er dir e ctions are similar to each other, e.g. 
iKasun fc Evrardl (120051 ); I White. Cohn fc Smitl (|2010h ; ICohn1 
I 201lf ). as expected. 

The correlation of af / ^J^2j{af) 2 (the fraction of the 

mass scatter in the PC^m direction, for the line of sight 
indexed by a) with | cos# (, s | for each observation is shown 
in Fig. [9] The medians and the averages of these correlation 
coefficient distributions are shown in Table [3] The largest 
average correlations of angles with the physical cluster axes 
are with PCo.m, the direction of the combination of mass 
scatters that dominates the scatter. The average correlations 
with PCi,m are slightly smaller, for the rest of the Pd : M 
the average correlations tend to zero (as can be seen, the in- 
dividual clusters can have larger correlations). For PCo,m, 
the largest correlation is with the direction of the cluster 
long axis I. The next largest signals are with the direction 
of the mass plane normal n mass , filament plane normal nm, 
and the direction of the largest substructure r su b- The ve- 
locity of the largest substructure t) su b and the direction of 
the most massive filament fm are more weakly correlated. 
That is, scatter dominated by the mass scatter combination 
in PCo.m tends to occur more often when the direction of 
observation is more aligned with the long axis of the cluster. 
(As the other cluster directions are not linearly independent, 
strong correlations with them are possible and seen as well.) 
As most of the scatter occurs along PCo,m and PC\ : m, 
both of which are most correlated with looking along the 
long axis, it suggests that most of the scatter is due to look- 
ing along the long axis I. 

The correlation with ag, the full contribution from 
PCo,m, rather than the fractional contribution above, was 
much weaker. 

One other line of sight quantity, X^i( a ?) 2 / ^i,M (the 
weighted scatter of M" bs ), also tends to have correlations 



16 We also measured correlations with another direction depen- 
dent quantit y, the amount of subst r uctur e found via the Dressler- 
Schectman IIDres slcr fc Shcctman 1988) test, as described in 
IWhite, Cohn fc Smrd | |201(} ); ICohnl feOllf) ; however, the correla- 
tions with PCq m were much weaker with this directional depen- 
dent quantity than with the ones reported here. 
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Figure 9. Correlations between the fraction of the line of sight 
mass scatter in the PCo,M direction and |cos0 o b s |. The cluster 
physical directions are £, the long axis of the cluster, n mass , the 
direction perpendicular to the mass disk with radius 10 h~ 1 Mpc 
and width 3 h~ 1 Mpc centered on the cluster, containing the ma- 
jority of the mass in halos, n^i, similar to n mass in shape and vol- 
ume, but oriented to contain the majority of the filamentary mass 
ending on the cluster, f^\, the direction to the most massive halo 
filament surrounding the cluster, and tisubi^subi the relative ve- 
locity and direction of the largest galaxy subgroup in the cluster. 
See text and i|2.3l for more detailed definitions. Only the 227 clus- 
ters which are filament endpoints are included in the comparisons 
concerning filament planes. Vertical dotted lines denote medians, 
and the averages and medians are shown in Table [3] The largest 
contributions to PCq, for most clusters, seem to come when ob- 
serving along the long axis of the cluster, but there is a wide 
scatter. The correlations with the mass and filament planes are 
maximized for observations along the planes, i.e. perpendicular 
to the normal vectors n mass ,nfli, as expected. 



with special cluster directions as seen in Fig. [TO] The largest 
fraction of clusters with correlations > 0.20 occurs with the 
long axis 

To summarize this section, each cluster's line of sight 
mass scatter variations and correlations were analyzed and 



17 Correlations with the cluster long axis and exter- 
nal filaments or nearby clusters, both possible causes of 
mass scatters, have been seen in many other works too 



Ivan de Wevea ert fc Bertschingerl jl996h; ISplinter et al 



IColberg et alj jl999h: [Chambers, Mclott , 
[bOOOh; lOnuora fc Thomas! ll200olh iFaltenbacher et alT 
Ivan de Weygaertj (12002^1 : |Hopkins. Bahcall 



Millci 



2002 
2004 
2005 



Bailin fc Steinmetj i bo o; ifh | Fa ltenba cher et al.. 
Kasun"' fc *FjvrardT ~ll2005h: iBasilakos et al] d2006h: I Lee fc Evrardl 
j2007h; iLee et alj [20081); |Paz. Stasvszvn fc Padillal feOOSl) 



IPereira. Bryan fc Gil d2008h : iRagone- r igueroa fc Plionie 



l|2007f ); ICosta-Duarte. Sodre. fc Durretl iboill) ; Iceccarelli et al' 
POIII ): IPaz et al] l|201ll l. correlations with the long axis 
and mass scatte rs specifically ha ve bee n discussed re- 
cently in, e.g. , Becker & Kravtsov (2011); iMarrone et alj 
(1201211; iBattaglia. Bond. Pfrommer fc Sieverj d201llh 

iBahe. McCarthy fc Kind j2012ft ; iFeroz fc Hobsonl <2012ft . 
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PC ,M 


pc 1jM 


PC 2 ,M 


PC 3 ,M 


PCi, M 


^ sub 


0.35(0.31) 


0.18(0.19) 


0.01(0.02) 


0.02(0.03) 


0.03(0.04) 


^sub 


0.21(0.22) 


0.08(0.10) 


0.00(0.01) 


0.02(0.02) 


0.01(0.01) 




0.28(0.24) 


0.16(0.15) 


0.03(0.04) 


0.00(0.01) 


0.00(0.00) 




-0.37(-0.29) 


-0.22(-0.21) 


-0.08(-0.08) 


-0.06(-0.04) 


0.01(0.02) 


Tlmass 


-0.38(-0.32) 


-0.24(-0.21) 


-0.07(-0.07) 


-0.04(-0.04) 


0.03(0.02) 


h 


0.56(0.50) 


0.34(0.33) 


-0.01(0.02) 


0.04(0.06) 


0.05(0.05) 



Table 3. The median of correlation coefficients between the fraction of the line of sight mass scatter covariance in each PC'i m direction 
and | cos dobs I- The average is shown in the parenthesis. The full distribution is shown in Fig. [9] 




Figure 10. Correlations of ^(af ) 2 Ai,M ( tne 

weighted scatter 

of M" bs ) with angle between line of sight and various cluster axes, 
as described in the text and in Fig. [9] Vertical lines are at average 
value. 



characterized separately using PCA. Both similarities (simi- 
lar large scatter directions) and differences (raw size of scat- 
ter, amount of variation with line of sight) were found. The 
scatter was usually dominated (large Ao,a//X^) by one 
combination of mass scatters, which also tended to become 
an increasing component of the total mass scatter as the an- 
gle of observation became more aligned along the cluster's 
long axis. 



4 CLUSTER TO CLUSTER VARIATIONS 

Now we turn to line-of-sight independent properties, PCA 
related or otherwise. The scatters of each cluster are also 
characterized by several numbers, i.e. scalars, which are not 
dependent upon the line of sight of observation (e.g. Ao). 
Here we take some characteristic mass scatter scalars for 
each cluster and compare them with other cluster proper- 
ties, intrinsic to the cluster or due to its environment, com- 
bining to give 24 quantities in total. We make use of severa l 
quant i ties o btained previously (WCS; iNoh fc Cohnl (|201 ll ); 
ICohnl f20n])) for this simulation and described in £}231 We 
use correlations rather than covariances, to take out the di- 
mensional dependence. After considering properties of the 



pairwise correlations in £14.21 we consider the full ensemble 
of correlations using PCA in £|4.3I 

4.1 Cluster quantities 

For each cluster we consider the following (note that mass 
scatters are for M obs = M cst — (M cst ) = M cst — M ave ): 

• AMi = (MaveA — Mtruc) /A/true, observed average mass 
offsets for each of the five observables considered earlier: red 
galaxy richness, phase richness, SZ, velocity dispersions and 
weak lensing respectively. Note that these correlations are 
identical to those of M aV e,i/MtYue- 

• | cos9 ,minaq\, i.e. \PC ,m ■ PC 0l M,minaq\, the angle be- 
tween the largest mass scatter direction for each cluster, 
PCo,m, and the direction PCo,M,minsq, given in Table [2] 
which minimizes the dot product with PCq,m for all the 
clusters. 

• "^2 A, the sum of the cluster's mass scatters in £13.21 

• Yl A, the product of the cluster's scatters in H3.2I 

• -yjx fraction of total (mass scatter) variance along di- 
rection of largest variance in H3.2I 

• Ai,Af, variance along the direction with second largest 
mass scatter variance in £13.21 

• A4,m, variance along the direction with smallest mass 
scatter variance in £13.21 

• T = 4 — 4, triaxiality, where the h,h,l3 are the axes 
of cluster, calculated using the dark matter particles in the 
FoF halo. 

• S = ^ (h > h > h), sphericity. 

• /M fplano , the fractional connected filamentary mass in 
the local plane around clusters defined in £|2.3I 

• /M hplanc , the fractional halo mass in the local plane 
around clusters, see £|2.3I 

• Msphcro, the mass in halos above 5 x 10 13 /i -1 Mq within 
a 10 h~ Mpc radius sphere of the cluster. We used the sum 
of large halo masses in a 10 h~ 1 Mpc radius sphere around 
the central cluster rather than the total mass because the 
former is already kno wn to be correlated with the mass of 
the central halo (e.g-. lNoh fc Cohnl (|201ll )). 

• Aftrue, the cluster (FoF b = 0.168) mass (also called M). 

• ,fj? Bub , the fractional richness of the largest galaxy sub- 
group (see £|2.3|1 . 

• fn sub , the ratio of the distance to the largest galaxy 
subgroup in the cluster to the length of the longest cluster 
axis (see ij2.3jl . 

• c v i r , the concentration (not scatter from mean concen- 
tration of a given mass), from fit ting all of the Friends of 
Frien ds halo particles to an NFW (|Navarro. Frenk fc White] 
Il997l ) profile. This was fou nd to be very i mpor- 
tant in the previous studies (|jeeson-Daniel et al.l 1201 ll ; 
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ISkibba fc Maccid l201ll ; lEinasto etafl l201ll ) inspiring this 
work. 

• ti:3, the time of most recent >1:3 merger (often taken 
as the threshold for a merger-driven starbust) 

• tmo, the time of most recent >1:10 merger (often taken 
as a threshold for merger-driven AGN feeding) 

• x g, the distance between the central galaxy position 
and the average of the galaxy p ositions. This is similar t o 
the "relaxedness" considered by ISkibba fc Maccid l|201lf ). 
but in that case they use the offset between the most bound 
particle and the halo center of mass. We also considered 
scaling by M 1 / 3 , the results did not change significantly. 

• cos 9a,o, the cosine of the angle between PCo and M avo , 
i.e., the largest mass scatter direction vs. the average mass 
offset direction. 

We thus have a space of dimension Af met hod = 24 for the 
correlation and PCA analysis below. Several of these quan- 
tities might be expected to be related. For instance, spheric- 
ity and triaxiality both characterize departures from perfect 
spheres, but triaxiality measures prolateness and oblateness 
while sphericity is sensitive to flatness. Similarly the sum 
and product of the eigenvalues probe the size of the largest 
eigenvalue and in principle how the largest and smallest 
eigenvalue change together, respectively. A priori, it isn't 
clear which of our large set of quantities have the strongest 
or most illuminating relations to each other, so we start with 
a large set. 
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4.2 Correlations 

Fig. [11] summarizes the pairwise correlations. For simplic- 
ity, only correlations which have absolute value > 0.2, 
in six ranges, ±0.2, ±0.35, ±0.5, are shown. The correla- 
tions of measurements with themselves are omitted. Filled 
dark (open red) symbols are positive (negative) correlations. 
Properties in the list are grouped by type: offsets of cluster 
average mass measurements from their true mass ("AM", 
defined in £14. 1|) . PCA related scalars for each cluster from 
H3.2\ ("\m"). cluster environment or shape ("physl"), cluster 
history ("phys2"), and |cos#a,o|. Relations between mea- 
surements of the same sort (e.g. concentration and time of 
last merger, etc.) can be seen in the diagonal boxes. 

The off-diagonal boxes thus correlate different sorts of 
cluster properties. We concentrate on these. The first thing 
to notice is that many large correlations are visible. We start 
with correlations with AM, each cluster's five average frac- 
tional mass measurement offsets, corresponding to the five 
measurement methods. The different components of AM 
often have similar correlations with other quantities. In par- 
ticular, they are all strongly correlated with sphericity, most 
are anticorrelated with triaxiality and all are anticorrelated 
with the offset of the average galaxy position. All but the 
weak lensing mass offset are also correlated with the smallest 
mass variance \a,m, that is, when the average mass offsets 
AMi are large, the variance in the direction of minimum 
scatter around the average measured value tends to be large 
as well: the line of sight averaged scatter often is increasing 
for clusters with larger mass scatters around these averages. 
There is also a trend of these average mass offsets being anti- 
correlated with signatures of relaxedness (fractional richness 



Figure 11. Correlations between cluster properties: 0.5 < corr 
(filled hexagons), 0.35 < corr < 0.5 (filled squares), 0.2 < corr < 
0.35(filled triangles), opposite signs are open versions of the same 
symbols, i.e. corr < —0.5 (open hexagon), etc. Auto-correlations 
are not shown. Cluster properties are described in section ^. II A 
blank space means that a correlation has absolute value < 0.2. 
The horizontal and vertical lines distinguish the types of cluster 
properties. AAf labels the five components of AM. Am refers to 
measurements for that cluster's mass scatter correlations, "physl" 
refers to cluster properties which are environmental or shape re- 
lated. "phys2" refers to cluster properties more related to sub- 
structure or merging (concentration, distance to largest substruc- 
ture, etc.), Ao refers to dot product or correlation with PCq m- 
Only half of the correlations are shown, as they are symmetric 
across the diagonal axis. 



of the largest subgroup, offset of the average galaxy position 
relative to the galaxy center, etc.). 

Turning to other correlations with the mass scatter vari- 
ances, an increase in the mass in halos around the clus- 
ter tends to be accompanied by an increase many of the 
measures of mass scatter, i.e. properties associated with the 
A;,m. This might indicate the presence of a supercluster. The 
size of the smallest mass scatter variance \a,m also seems to 
increase with increasing fractional halo mass in the plane of 
the cluster, fu hplanc ■ 

Lastly, triaxiality is correlated with indicators of sub- 
structure, (the offset of the galaxy average position from 
the central galaxy and the fractional richness of the largest 
galaxy subgroup), while in contrast sphericity is anticorre- 
lated with these, as well with the fractional distance of the 
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Figure 12. Correlation coefficients between the cluster proper- 
ties listed in i|4.1l and their projection on the first four principal 
components. Horizontal dotted lines at ±0.4 are to guide the eye 
to larger positive or negative correlations. The top box is for PCo, 
the second is for PCi and so on; the expansion of the PCi vectors 
is given by these correlations divided by the A^ eigenvalues, 5.23, 
3.0, 2.07, 1.94 respectively for PCo , PG\ , PCi , PCs ■ 



largest galaxy subgroup from the cluster center. Physically, 
one might expect a rich subgroup in the cluster to lie along 
the long axis of the cluster (e.g. coming from a filament 
feeding matter into the cluster) and be correlated with a 
lengthened (more triaxial) cluster as a result. 



4.3 PCA 

By using PCA on these pairwise correlations, more gen- 
eral groupings can be investigated. This application of 
PCA to our sample is an e xtension of PCA of severa l 
cluster properties studi ed by iJeeson-Daniel et al.l (l201ll ); 
ISkibba fc Maccid (|201lh and by lEinasto et all (|201lh for 
supercluster s, which inspired t hi s stu d y. Properties cor- 
relate d by IJeeson-Daniel et"aH (120111 ); ISkibba fc Maccid 
(2011) included virial mass, concentration, age, relaxedness, 
sphericity, triaxiality , subs tructure, spin, and environment. 
I Jeeson- Daniel et alj (|201lh used 1867 halos from several 
boxes with masses ranging from ~ 10 11 h~ x Mq up and found 
that the domi nant PC vector wa s most correlated with halo 
concentration. ISkibba fc Maccid (|201ll ) used several boxes 
with halos with masses > 10 10 /i -1 Mq, and also found halo 
concentration to be important for clusters, as well as halo 
mass and degree of relaxedness0 Our sample is closest to 
the high mass tail of these samples. 



The 24 PC eigenvalue^ are A</ £ A=(0.22, 0.12, 0.09, 
0.08, 0.07, 0.05 (3 times), 0.04, 0.03 (4 times), 0.02 (3 
times), 0.01 (6 times), < 0.005 (2 times)). Unlike the case 
of mass scatter, where Ao,m is relatively large, (on average 
Ai,Af/Ao,Af < 0.4), here A1/A0 ~ 0.6, and (Ai, A2, A3, A4) = 
(5.23, 3.00, 2.07, 1.95), fairly close to each other. This makes 
interpretation less straightforward. However, subsets of 
properties with strong correlations with individual Pd may 
indicate these properties change together. To better identify 
relations, one can take these subsets and do PCA on the 
correlations within this subset alone. An example is given 
below. The expansion coefficients of the Pd for the differ- 
ent measured quantities, the /3% in Eq. [T] are given by the 
correlations divided by the eigenvalues. (For example, the 
expansion coefficients of the PCo in the order listed in H4.ll 
are (0.19, 0.26, 0.34, 0.35, 0.30, 0.03, 0.23, 0.16, 0.06, 0.19, 
0.22, -0.20, 0.35, 0.09, 0.10, 0.11, -0.11, -0.22, -0.14, 0.05, 
-0.06, -0.03, -0.33, -0.12).) 

The strongest correlation is between the velocity disper- 
sion mass offset AMy c i and PCo (~0.8), 9 other properties 
have correlations with absolute value above or equal to 0.7 
with at least one of the Pd (i < 6). Considering smaller 
correlations, 21/24 of the properties have at least one corre- 
lation of absolute value > 0.5 with one of the Pd, i < 11, 
and all properties have at least one correlation > 0.4 with 
at least one Pd. The correlations of the first four princi- 
pal component vectors with the 24 properties are shown in 
Fig. 1121 Lines are drawn at ±0.4 to guide the eye to the larger 
correlations. The sign of the correlations with the eigenvec- 
tors depends upon the overall sign of the eigenvectors, which 
is arbitrary, but the relative signs of the correlations of the 
different properties with each eigenvector are not arbitrary. 

We begin with PCo, the direction of largest variation. 
The AM/s, the sum of the mass scatter variances £ A, the 
individual mass scatter variances \i,m , )\4,ai and sphericity 
S all have large correlations (> 0.4) with their projections 
onto PCo. These correlations are in the opposite sense of 
those of T, fii Bub and x f / (these latter three might indicate 
less relaxed halos). A relation between large triaxiality and 
an increased offset in the average galaxy position or the 
presence a rich subgroup is somewhat intuitive, but the sign 
of the relation to the average mass measured for the cluster is 
surprising. One interpretation of the relation between AMi 
and T seen both here and in the correlations is that clusters 
which are more triaxial have fewer lines of sight along the 
elongated and presumably large measured mass direction, 
so that fewer lines of sight re sult in a large Mm t/Mt IUB — 1. 
See also related discussion in lRasia et alj (|2012)r°l 

To go further, we considered the subset of prop- 
erties which correlate strongly with PCo (AMi,S,T, 
/«™bi I o//iE'' l i Ai.MjA^m) and applied PCA to their in- 
tercorrelations. The trends with PCo for these quantities 
appear with the direction PCo, subset as well. In this case, 



18 For superclusters in SDSS DRY, lEinasto et"aH feOllft consid- 
ered weighted luminosity, volume, diameter, density of the highest 
density peak of galaxies and the number of galaxies, as well as 
shape parameters, and used the two largest PCi to find scaling 
relations. They divided up the sample into two sets of superclus- 
ters based upon where they lay in the planes spanned by pairs of 
PC ,PCi,PC 2 - 



9 Some of the previous analyses consider logarithms of the scalar 
quantities, we redid the analysis taking logarithms of A, FJ A, 
Al,Afi M,M, /AI fplanc , /A/ fepia „ c , /flsub, /o sub ! c ) Msphere! M and 
found very similar results for correlations with the first 2 PCi as 
below, for PC 2, correlations roughly increased for the merger 
related quantities, and decreased for the Mg p h ere and plane envi- 
ronment quantities. 

20 We thank E. Rasia for discussions about this. 
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however, A , su b sct is relatively larger, Ao, su bset/ J2 A = 0.39. 
To help in understanding, we took three groups of prop- 
erties (1) AMi (except AA/wl); (2) the mass scatter mea- 
sures J2 A, Ai,m, A 4 ,m; and (3) T,f Rsub ,x of f , S, AM W l - The 
properties in (1) and (2) have large same sign correlations 
with PCo- For (3), sphericity S and average weak lensing 
mass offset AMwl have this sign correlation as well, while 
T, fR Sub ,x f f have the opposite sign. These three groups 
(roughly) can be seen in correlations with PCi )Su bset (0.17 of 
the sum of eigenvalues), but in this case the relative behav- 
ior between properties in (2) and (3) reverses (i.e. increased 
mass scatter now has the opposite correlation with PCi 
as increased sphericity does), and changes in (1) all have 
correlations < 0.07 with PCi jSu bset- (Note that the general 
orthogonality of the PCi means relations between proper- 
ties, i.e. relative sizes and signs of correlations, do have to 
change from from one Pd vector to another.) The corre- 
lation strengths change as well, going from PCo to PCi, 
increasing for (2) and decreasing (below 0.4) for (3). Con- 
sidering the trends in broad brush, for a given cluster, large 
measured average (over lines of sight) mass offsets from true 
mass, aside from weak lensing, seem to tend to come with 
high sphericity and large mass measurement scatters, and 
low triaxiality, fn mb ,Xoff- That is, when a cluster's mass 
offsets are very large, there is a tendency for the vector 
of all its properties to be lying far along PCo, subset, im- 
plying trends for the other properties. When average mass 
offsets (besides AMwl) are small, it is not clear whether 
a given cluster's variations are lying along PCo, sub set or 
PC\ , subset or some other direction. As the relation between 
large sphericity (and small triaxiality, fii sub ,Xoff), i.e., (3), 
and the size of mass measurement scatters, i.e., (2), reverses 
between these two PCA vectors, small mass offsets makes it 
difficult to estimate the relation between them (i.e., between 
(2) and (3)). 

Continuing with the full set of 24 properties, and con- 
sidering correlations with PCi, the large sphericity /small 
triaxiality, (relatively) small richness in biggest subgroup, 
(relatively) small distance of largest subgroup from cen- 
ter and small offset of average galaxy positions with cen- 
tral galaxy position relation trends are also seen, but more 
weakly. Fluctuations in the direction of PCi have an in- 
crease in \PCq,m ■ PCo,M,minsq\ tied to a decrease in overall 
mass scatter (^ A, FJ A, Xi.m , A4.A/) and a decrease in mass 
in nearby large halos Msphere- This last perhaps indicates 
that the alignment of PCo, u is better, and the mass scat- 
ter smaller, when the cluster is not in a supercluster. Low 
concentration and relative distance of the largest subgroup 
from the cluster center are accompanied by earlier major 
mergers for variations in the direction of PC 2- The fraction 
of mass in the filament and mass planes seem to change 
together for changes along PC3, along with the amount of 
mass in nearby large halos. Again, these last 3 properties 
can be studied alone via PCA. In this case the associated 
Ao, subset/ X] A = 0.65. Note this is a different subset of PC 
vectors than considered earlier, and corresponds to all three 
quantities, normalized by covariance, changing by about the 
same amount with the same sign. This may imply that clus- 
ters which are in a richer environment, that is, with more 
nearby halos, may have the mass around them distributed 
in a more planar shape. 

We experimented with a larger set of properties than 



shown here, including for example the quantities shown in 
the first column of Fig. [9] for each cluster, i.e. the correla- 
tions with projections on PCo, m and physical directions. In 
this case the correlations with PCo for the other quantities 
remained essentially as shown in the top line of Fig. [12] but 
the PCo,M-long axis projections and the PCo,M-direction 
of largest subgroup projections had large correlations with 
PCo in the same sense as T, cluster triaxiality. 



5 AN ENSEMBLE OF CLUSTERS 

The previous sections considered cluster directional mass 
scatters and variations in properties, cluster to cluster. One 
can also consider the joint ensemble of mass measurements 
of all the simulated clusters and all their lines of sight, for all 
five methods. A comparison is interesting between the PC 
vectors of the individual clusters, for many lines of sight, 
and those for all the clusters together. 

5.1 Trends for all clusters considered together 

We take the union of the estimated masses of all clusters and 
all lines of sight (aside from those discarded as discussed 
in §2) and repeat the analysis of §3. Now M bs/M tr uc = 
Mest/Mtrue — (Mest/Mtrue). The average (M e st/Mt rU c) is over 
all clusters and lines of sight, and for our case is within a 
few percent of 1. The resulting covariance matrix for these 
mass scatters is shown in Table [4] The covariances for the 
combined sample are larger than the averages for individ- 
ual clusters, the latter are shown in Table [4] in parentheses 
for comparison, and were plotted individually in Fig. [5] The 
increased scatter is not unexpected as it is a combination 
of scatters from objects which are all themselves scattered 
around different central values. The corresponding PC eigen- 
values are A ilM , tota! =(0.31, 0.18, 0.094, 0.061, 0.022). The 
direction of largest scatter has \o,M,totai/ ~}2 ^ ~0.47, com- 
pared to the average of 0.66 for individual clusters shown 
in Fig. [(J] The total mass scatter variance ^ A = 0.67, 
which is relatively high compared to the average for in- 
dividual clusters; the product FJ A is also relatively larger 
(7xl0 -6 ), as to be expected from the increased covariances 
111 Table H The directions of largest scatters are sim- 
ilar to those for the clusters considered separately (Table 
[3J) , although PC \,m, total has a much smaller SZ component 
than PC4,M,minsq- The correlations with PCo,M,totai of the 
projections of the different mass observables (Mjv rod , Mn h , 
Msz, Mvel, M WL ) are (0.69, 0.57, 0.57, 0.79, 0.57), similar 
to the median of their individual cluster counterparts, (0.69, 
0.51, 0.48, 0.87, 0.47), shown in Fig. [8] The correlations are 
fairly large for the other Pd components, but these by def- 
inition contribute less to the total mass scatter. PCo,M,totai 
and AM are 23° apart, similar to their counterparts for the 
individual clusters. Again, just as for individual clusters, the 
closer one is to the long axis of the cluster, the larger the 

21 For the 70 clusters with mass > 2 X 10 14 h~ 1 MQ, the total 
scatter A goes down (to 0.44 from 0.67 for the sample with 
M > lO 14 h. _1 M ), and the direction of the PC ,M,total slightly 
rotates as seen in Tabled The fraction of variance in \o,m in- 
creases, i.e. the direction of the largest scatter has more of the 
scatter. 
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AfjV rod /A/t r ue Afjv ph /Mtiue Afsz/Aftrue Myel/Aftrue A/wL /Aftrue 

Mjv^/Mtrue 0.18 (0.08) 0.06 (0.02) 0.05 (0.04) 0.04 (0.03) 0.04 (0.03) 

M N h /M true 0.06 (0.02) 0.04 (0.02) 0.02 (0.01) 0.02 (0.01) 0.02 (0.01) 

Msz/Mt™ 0.05 (0.04) 0.02 (0.01) 0.09 (0.07) 0.04 (0.02) 0.04 (0.01) 

Afvd/Mtrue 0.04 (0.03) 0.02 (0.01) 0.04 (0.02) 0.24 (0.20) 0.05 (0.02) 

M WL /M t rue 0.04 (0.03) 0.02 (0.01) 0.04 (0.01) 0.05 (0.02) 0.11 (0.05) 

Table 4. Covariance matrix for full set of measurements A:f orjs /Aitrue, for all clusters, and, in parentheses, the average of the individual 
cluster covariances in Fig. [4] The median values for the the individual cluster covariances tend to be smaller than the average values. 
The covariances for the full sample tend to be larger than the averages cluster to cluster. 



fraction of scatter due to PCo t M,totai (~0.4 correlated, see, 
e.g. Fig. [9] for the individual cluster distribution). The next 
leading correlations of PCo,M,totai are with the position of 
the largest subgroup and the direction perpendicular to the 
cluster's dominant filamentary plane or mass plane. These 
trends were very close to those found for clusters individu- 
ally. These relations are still however based on the estimated 
mass as a function of true mass, and thus not immediately 
applicable to observational samples, as we now discuss. 

5.2 Future extensions to observational samples 

One can also consider using some variant of our PCA anal- 
ysis for an observational sample, which would have some 
range of true cluster masses and some observations, as we 
do in our box. However, our implementation of PCA doesn't 
directly carry over, and our measurement sample in hand is 
not appropriate. We discuss these limitations and possible 
ways forward here. 

Observations are concerned with Mtruc as a function 
of estimated mass, M true (M cs t). An observer does not have 
Aftrue- We have instead been calculating M cs t(M tr uc) and in 
addition dividing by Mtruc. Applying PCA in a way useful 
to observations (so that one can take 5 methods to measure 
mass of one cluster and compare to the PC vectors calcu- 
lated in simulation) requires PC vectors calculated, from 
simulation, for a representative range of Af cst and Mtrue, 



and a proxy for the unobservable Mt, 



As a proxy for 



Mtme, one possibility is the likelihood mass. This would 
come from simulations, which are already required to cal- 
ibrate covariances and offsets. Another possible mass proxy 
uses the principal component vectors directly, in principle 
encoding similar information. That is, one has from the def- 
inition of PC vectors, 



M°L 



Mtruc Mtruc 



M C! 



Mtr 



(5) 



The quantities aside from M tr uc are either mea- 
sured , e.g., MSjt, or calculated from simulations, e.g., 
{Meat /Mtruc), PCi,M- In particular, we have 



Pd. 



Mgt 

Mtruc 



M c: 



M tr 



= a 4 



(6) 



but the variance of a 4 is given by \ 4 ,m, total- If M,M,tot a l is 
very small then we can try the approximation 0,4 ~ , so 
that projecting on PC '4,M,totai gives the approximation 23 ! 



Ma 



PC 4,,M, total ■ { Mtrue ^ 



(7) 



The general use of this approach depends upon the size of 
\i,M,total for the system; the smaller \A,M,total is, the better 
it appears this approximation should work. As there tend to 
be some minus signs in PC 4, some catastrophic failures will 
occur where M4 <C M es t,i, for all i. 

However, the relevance of testing these possible M tr ue 
proxies on our sample seems limited. In particular, the 
P(M est |Af t ruc) for our sample (all M > 10 14 /t _1 M Q halos 
observed along ~ 96 lines of sight each) is not representative 
of any expected observed sample, nor is our P(Mt rtt e), which 
is required for likelihoods. Observing our few high mass clus- 
ters along many lines of sight is not a good approximation to 
observing many clusters along one line of sight: the few high 
mass clusters observed along many lines of sight in particu- 
lar do not well sample the realistic population of high mass 
clusters. There are no clusters appearing fewer than ~ 96 
times; in particular, rare high mass clusters which would be 
expected from the number of lower mass clusters "present" 
are missing. Halos with the same M os t (the only observable) 
but a lower A/true will also occur in an observational sam- 
ple, and might contribute differently to the scatter as well. 
It would also be important to include the neglected, in our 
simulation, line of sight larger scale scatter for SZ and weak 
lensing and the systematics mentioned above. These would 
be very interesting directions to pursue in future work. 

In summary, in this section we considered all the clus- 
ters in the box, along all lines of sight, to see how cluster- 
to-cluster variation altered mass scatter relations found in 
earlier sections for individual clusters; the trends remained 
but changed in strength. In particular, the dominant mass 
scatter combination, similar in form to that for many of the 
individual clusters, still seems to be more prevalent when 
looking down the long axis of the cluster or perpendicular 
to the mass or filament plane of the cluster. In the second 
subsection, we mentioned two possible methods for extend- 
ing our analysis which do not require prior knowledge of 



22 Using no proxy for Mtrue, i.e. doing PCA on M es t alone, dif- 
ferently weights the scatter of high and low mass clusters. In 
our sample, caveats below, most of the scatter then corresponds 
to all mass estimates increasing or decreasing together in equal 
amounts. Projections on this combination of mass scatters are 
weakly correlated with observations along the long axis of the 
cluster, and less correlated with other directions. 



23 An extreme and unphysical limit of this case would be if one 
measurement method had tiny scatter and no correlation with 
the other measurement methods. In this case PC 4 m total would 
be proportional to a measurement via this one method, and the 
last equation would basically give that the estimated mass in this 
method is the true mass; up to any overall biases that might exist, 
an unbiased mass estimator would make the denominator 1. 
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Mtrue, replacing it with the likelihood mass or a mass de- 
rived from PC a- These would be interesting to apply to an 
observational sample, but would need a very closely matched 
simulation. It would be interesting to see if the resulting PC 
vectors and projections on them by estimated mass measure- 
ment methods have correlations with cluster orientation or 
perhaps M proxy /M t rue ■ 



6 OUTLIERS 

We have focussed on general trends above, but not all clus- 
ters (or all lines of sight) fell on the general trends. We 
searched for properties of outliers or tails in the distribu- 
tions of the various quantities related to A^m, outliers in 
average cluster measured mass vs. true cluster mass, clus- 
ters with different maximum or minimum covariance mass 
measurement method pairs than the majority of clusters, 
and clusters where PCo,a/ had at least one opposite sign 
coefficient with absolute value > 0.1 (i.e. largest direction 
of scatter not corresponding to all mass scatters increasing 
together, mentioned earlier). 

The various outliers did not seem to follow any pattern. 
Some outliers were common to a cluster, e.g. sometimes a 
large A^m outlier occurred with a large \\ A, as might be 
expected, as A4,m is the smallest of the A^m- Some of the 
clusters which had PCo,m not aligned with the most likely 
PCo,M,minsq had much larger contributions to mass scatter 
from Compton decrement than the average cluster, often due 
to close massive halos, and Ao,m / X] ^ tending to be smaller 
than for the usual cluster. The clusters having different cor- 
relations of lines-of-sight properties with mass measurement 
methods don't seem to have a clear relation to the outliers 
in mass scatter properties. 



7 SUMMARY AND DISCUSSION 

The scatters between estimated and true cluster masses, for 
different observational methods, are often correlated. Un- 
derstanding these correlations is becoming more and more 
important as reliance on multiwavelength measurements in- 
creases. For instance, correlations and covariances in scat- 
ters affect both error estimates in multiple measurements 
of individual clusters an d produce a b ias in measurements 
of sta c ked objects (see Rvkoff et al.l 
(|2010h ; IWhite. Cohn fc Smitl (|201(t ): 



J2008I'); iStanek et all 
lAngulo et all (|2012T ) 
for detailed discussion). (Using the full covarian ce matrices 
has b e en done in some clu s ter analyses, e . g. by iRozo et al.l 
|2009l ); lMantz et al l (|201(]| l; lBenson et al.l l|201ll U We char- 
acterized the scatter and correlations of clusters in two ways. 

We started by considering clusters individually to iden- 
tify mass scatter properties due to line of sight effects for 
243 clusters in an N-body simulation, each along ~ 96 lines 
of sight. We used five observational mass proxies: red galaxy 
richness, phase space galaxy richness, Sunyaev-Zel'dovich 
flux, velocity dispersions and weak lensing. It would also 
be very interesting to include X-ray observation as well, but 
our attempt at a proxy, based on assigning fractional X- 
ray flux to cluster galaxy subgroups, was not illuminating. 
These are employed to find cluster masses in current and 
upcoming large scale cluster surveys. 



For each cluster, we characterized the "shape" and "vol- 
ume" of the mass scatters of M es t calculated as a function of 
Mtrue, using PCA, or Principal Component Analysis, to ob- 
tain a set of non-covariant measurements. Most clusters had 
one combination of observational mass scatters contribut- 
ing the majority of the mass scatter, and this combination 
was similar for many of the clusters, i.e. they had a similar 
largest principal component PCo,m- This scatter combina- 
tion was a larger fraction of the total line of sight mass 
scatter when the cluster was observed along the long axis of 
the cluster. Weaker relations with observations along other 
cluster intrinsic and environmental axes were seen. Identi- 
fying the long axis of course requires the clusters to have 
non-spherical shapes. In our case the cluster member dark 
matter particles were determined using the FoF finder with 
linking length b — 0.168. 

Individual cluster mass scatter properties due to line 
of sight effects were then compared to several intrinsic and 
environmental cluster properties, including triaxiality, pla- 
narity of filament or halo mass in the immediate neighbor- 
hood of the cluster and relative richness of largest galaxy 
subgroup within the cluster. For example, pairwise cor- 
relations, and their combined effects using PCA, showed 
that clusters with average mass measurements (over lines 
of sight) which are large tend to also have large mass scat- 
ter around their average, relatively high sphericity and small 
triaxiality, richness in the largest subgroup, and offset of the 
galaxy position average from the cluster center. Relations 
were also seen for other quantities such as concentration, re- 
cent major merger time and fraction of halo mass near the 
cluster within a 3 h~ 1 Mpc plane. 

Finally, instead of considering each cluster individually, 
we considered the sample of all clusters and all lines of sight 
together, and found that most of the trends for the anal- 
ysis of Most/Mtruo remained, albeit at different strengths. 
The projection on the direction of largest scatter was more 
weakly correlated with the observation angle relative to the 
cluster long axis. 

It is interesting to think about applying these methods 
directly to observation. One would need more information, 
in particular estimates of M t rue(M e st), rather than the op- 
posite which we have here. This requires calibrations from 
simulations which better sample an expected observational 
sample at the high mass end, and which also include es- 
timated masses from lower mass halos (as well as faith- 
fully reproducing the observational systematics and selec- 
tion function). Such a simulation would provide correlations 
and covariances and predicted likelihood masses Mu^e (and 
a variant, M4 considered above, based on a narrow direc- 
tion of scatter in our sample). PCA could then be applied 
to Most/Miiko or M es t/M4, rather than Most/Mtruo as we did 
here. It would be interesting to see if these PC vectors also 
had relations to cluster orientation such as we found, or per- 
haps other quantities such as Mn^ e /M tT ua- They might also 
give some idea of which follow up mass measurement meth- 
ods would together provide the most constraining power. 
One could for example identify the measurement method 
with potentially the least covariance with measurements al- 
ready in hand. More generally one could design a combina- 
tion of measurement methods with smaller covariant scatter 
(and hopefully smaller scatter as well) using calculated PC 
vectors as a guide, if the simulations are faithful enough. It 
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is a major challenge to accurately capture the systematics 
and selection function of observational surveys with numer- 
ical simulations. Another technical issue is to improve es- 
timates of the correlations and covariances, so that sets of 
inconsistent measurements can be more easily recognized. 
It would be very interesting to do such analyses on a larger 
box, and/or with other measurement methods. 
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